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19 Abstract: 

20 Infectious bronchitis virus (IBV) infects ciliated epithelial cells in the chicken respiratory tract. 

21 While some IBV strains replicate locally, others can disseminate to various organs, including 

22 the kidney. Here we elucidate the determinants for kidney tropism by studying interactions 

23 between the receptor binding domain (RBD) of the viral attachment protein spike from two 

24 IBV strains with different tropisms. Recombinantly produced RBDs from the 

25 nephropathogenic IBV strain QX and from the non-nephropathogenic strain M41 bound to 

26 the epithelial cells of the trachea. In contrast, only QX-RBD binds more extensively to cells of 

27 the digestive tract, urogenital tract, and kidneys. While removal of sialic acids from tissues 

28 prevented binding of all proteins to all tissues, binding of QX-RBD to trachea and kidney 

29 could not be blocked by pre-incubation with synthetic alpha-2,3-linked sialic acids. The lack 

30 of binding of QX-RBD to a previously identified IBV-M41 receptor was confirmed by ELISA, 

31 demonstrating that tissue binding of QX-RBD is dependent on a different sialylated glycan 

32 receptor. Using chimeric RBD proteins, we discovered that the region encompassing amino 

33 acids 99-159 of QX-RBD was required to establish kidney binding. In particular, QX-RBD 

34 amino acids 110-112 (KIP) were sufficient to render IBV-M41 with the ability to bind to 

35 kidney, while the reciprocal mutations in IBV-QX abolished kidney binding completely. 

36 Structural analysis of both RBDs suggests that the receptor binding site for QX is located at a 

37 different location on the spike than that of M41. 

38 

39 Importance: 

40 Infectious bronchitis virus is the causative agent of Infectious bronchitis in chickens. Upon 

41 infection of chicken flocks, the poultry industry faces substantial economic losses by 

42 diminished egg quality and increased morbidity and mortality of infected animals. While all 

43 IBV strains infect the chicken respiratory tract via the ciliated epithelial layer of the trachea, 

44 some strains can also replicate in the kidneys, dividing IBV in two pathotypes: non- 

45 nephropathogenic (example IBV-M41) and nephropathogenic viruses (including IBV-QX). 

46 Here we set out to identify the determinants for the extended nephropathogenic tropism of 
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47 IBV-QX. Our data reveal that each pathotype makes use of a different sialylated glycan 

48 ligand, with binding sites on opposite sides of the attachment protein. This knowledge should 

49 facilitate the design of antivirals to prevent coronavirus infections in the field. 

50 

51 Introduction: 

52 Infectious bronchitis is a disease in chickens caused by infectious bronchitis virus (IBV). In 

53 the poultry industry, infection of chicken flocks with IBV causes economic losses by reducing 

54 egg quantity and quality. In addition, animals become more susceptible to secondary 

55 bacterial infections like E. coli (1). The severity of disease and organs affected depends 

56 primarily on the IBV strain (2). Phylogenetic classification of IBV strains results in 32 

57 phylogenetic lineages (G-l 1-27, G-II-GVI) (3), of which GI-1 includes historically the first IBV 

58 genotype identified, Massachusetts (IBV-Mass). IBV-Mass infections are reported worldwide, 

59 and in Europe GI-1 is currently the 3 rd most prevalent genotype (2). The more prevalent IBV 

60 genotype circulating in Europe is IBV-QX (GI-19) (2, 3), which has been reported to cause 

61 kidney disease in contrast to IBV-Mass (2). 

62 

63 IBV primarily infects the respiratory tract, where the virus can bind and infect the ciliated 

64 epithelial lining of the trachea (4, 5). Upon infection of IBV clinical symptoms such as 

65 snicking, wheezing and/or nasal discharge are reported (6). While infection of IBV-Mass (of 

66 which strain M41 is the prototype) is predominantly detected in the upper respiratory tract (7) 

67 including the trachea (2), replication of IBV-QX is additionally found in the kidneys (7-9), 

68 oviduct and the gastrointestinal tract (10, 11), leading to additional clinical symptoms like 

69 swollen proventriculus (12) and reduction of egg production (13, 14). Because of these 

70 additional clinical symptoms, IBV-QX is described as a nephropathogenic IBV strain (2). 

71 

72 Binding to host tissues is the first step in the viral life cycle of IBV and therefore a critical 

73 factor in determining tissue tropism. Tissue tropism differs based on the amino acid 

74 composition of the spike protein as shown by recombinantly produced proteins (15-17) and 
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75 infection assays with recombinant viruses (18). The spike of IBV is post-translationally 

76 cleaved into two subunits, SI and S2, where S2 is anchored in the virus membrane and 

77 important for membrane fusion. SI comprises the head domain of spike and is responsible 

78 for host receptor binding (19). Using recombinantly expressed M41-S1 proteins, alpha-2,3- 

79 linked sialic acids were identified as the IBV receptor on a glycan array, where specific 

80 binding to the ligand Neu5Aca2-3Gaipi-3GlcNAc was observed (19). Recently the cryo-EM 

81 structure of the M41 spike has been resolved (20), indicating that the SI subunit consists of 

82 two independent folding domains, NTD (amino acids 21-237) and CTD (amino acids 269- 

83 414), with a proposed receptor binding site in both domains. Experimental evidence using 

84 recombinantly expressed spike domains have indicated that amino acids 19-272 of the M41 

85 spike are sufficient for binding to trachea as well as binding to alpha-2,3-linked sialic acids 

86 (15). This domain thus contains a receptor binding domain (RBD) and can be used to study 

87 biological implications of genetic variation in circulating IBV genotypes. 

88 

89 In this study we set out to identify how genetic variations in IBV spike proteins have 

90 contributed to different host tropisms. We demonstrate that QX-RBD binding to trachea and 

91 kidney is dependent on a different sialylated glycan ligand compared to M41-RBD. In 

92 particular, introduction of amino acids 110-112 (KIP) of the QX spike into M41-RBD was 

93 sufficient to extend its tropism toward the kidney. Previous docking experiments (17)and 

94 structural analysis suggest that the binding pockets for the different glycans are located at 

95 opposite sites of each spike protein. 

96 

97 Results: 

98 The N-terminal domain of IBV-QX spike contains a receptor binding domain. 

99 Eighty-five percent of the amino acids between the sequences of the first 257 amino acids of 

100 IBV-QX and IBV-M41 are either identical or similar. Here we set out to determine which of 

101 the dissimilar amino acids are the determinants for the difference in tissue tropism. 
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102 

103 In previous work we demonstrated the M41-RBD was sufficient to bind chicken trachea (15). 

104 To verify that no additional sites are present in M41 that could bind kidney or trachea tissue, 

105 we produced recombinant proteins consisting of the full ectodomain (ED), the SI portion of 

106 the ED, the RBD (NTD of SI), and the CTD of SI. Each protein was assessed for binding 

107 using trachea and kidney tissue slides. Binding to trachea tissue was observed using M41- 

108 ED, SI and RBD but not CTD to ciliated epithelium of the trachea, specifically located at the 

109 base of the cilia (Fig. 1), confirming previous observations (15, 19, 21). None of the proteins 

110 bound kidney tissue, which is shown by a representative picture using M41 -RBD (Fig 1B). 

111 Binding affinity to the known ligand (Neu5Aca2-3Galp1-3GlcNAc) in ELISA was observed 

112 using M41-RBD, -SI and -ED, not significantly different when compared to each other, but 

113 significantly higher compared to M41-CTD and TCoV-SI (Fig. 1C). These results indicate 

114 ligand binding of M41-RBD is not significantly different compared to M41-S1 and -ED, 

115 suggesting no additional ligand binding motifs are present in SI and ED, thus, in the 

116 remaining experiments we used M41-RBD as the tissue tropism of the virus is reflected using 

117 this recombinant protein. 

118 Amino acid alignment of the mature protein sequence of the receptor binding domain (RBD) 

119 of M41 and a comparable size fragment of the QX spike displayed a sequence identity of 

120 73.6% (Fig. 2A), with highest sequence diversity between amino acids 37-60 and 98-115. 

121 These regions include the previously described hyper variable regions (HVRs, highlighted in 

122 grey) of M41-S1 (22). Before studying whether sequence diversity between the RBDs of M41 

123 and QX contributes to the reported broader tropism of QX in vivo we first determined if the 

124 potential RBD of QX behaved like that of M41 (Fig. 1) and that it contains a receptor binding 

125 domain (15). Both proteins were produced as soluble recombinant protein in mammalian 

126 cells and analyzed on western blot after purification. Before loading, a fraction was pre- 

127 treated with PNGaseF to remove post-translational glycosylation. QX- and M41-RBD 

128 migrated comparable at around 55kDa (including glycosylation) and had a backbone of 
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129 around 32kDa as expected after PNGaseF treatment (Fig. 2B). Circular dichroism (CD) 

130 spectroscopy was used to assess similarities in secondary structure between M41- and QX- 

131 RBD. Spectra at all temperatures followed the same curve, and both proteins had similar 

132 broad melting curves, indicating that both proteins are equally stable (data not shown). 

133 Subsequent secondary structure calculations using Dichroweb (23) presented that M41- and 

134 QX-RBD contain 29 and 25% a-helix, 16 and 17% (3-strands, and 55 and 58% random 

135 structures, respectively (Fig. 2C). Finally, we confirmed that the QX-RBD was biologically 

136 active by applying it to chicken trachea tissue slides in protein histochemistry. We observed 

137 clear binding to the ciliated lining of epithelial cells and structures present in the kidney (Fig. 

138 2D) indicating that QX-RBD, like M41-RBD, contains a receptor binding site. 

139 QX-RBD shows a broader tissue tropism than M41-RBD. 

140 Next, we used M41- and QX-RBDs to study the distribution of host attachment factors across 

141 chicken tissues. To this end, we allowed both proteins to bind to tissue microarray slides 

142 containing 28 different chicken tissues (24). Binding of M41-RBD was primarily found on the 

143 ciliated lining of the epithelium of the proximal and distal trachea (Fig. 1), but additional 

144 staining was observed in the epithelial lining of the colon, cecal tonsil, ureter, oviduct, and 

145 conjunctiva (Table 1). QX-RBD bound to the same tissues as M41-RBD, but additional 

146 binding was observed in gizzard, ileum, and cloaca of the digestive tract, as well as liver and 

147 kidneys (Table 1 and Fig. 2D), reflecting that observed in vivo for replication of both 

148 genotypes. Detailed analysis of staining present in the kidney showed that binding of QX- 

149 RBD was restricted to the parietal epithelium of Bowmans capsule in the glomerulus (Fig. 

150 2D). No binding to the glomeruli was observed when using M41-RBD in three independent 

151 experiments using different protein batches. Taken together, QX-RBD shows a markedly 

152 broader binding profile than M41-RBD, which is in line with the reported broader tissue 

153 tropism in vivo (2). 

154 QX-RBD binds to sialic acids on chicken tissues. 
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155 To investigate whether the expanded tropism of QX-RBD can be explained by binding with 

156 similar specificity, but higher affinity, to the previously identified M41 receptor (19), we pre- 

157 incubated both RBD proteins with the synthetic Neu5Aca2-3Galp1-3GlcNAc before applying 

158 them to trachea and kidney tissue slides. As expected, binding of M41-RBD to the trachea 

159 was completely prevented (Fig. 3A, middle column) in the presence of the synthetic M41 

160 ligand. In contrast, QX-RBD still showed strong binding to the ciliated epithelium of the 

161 trachea and glomeruli of the kidney. To confirm the loss of binding of QX-RBD to Neu5Aca2- 

162 3Galp1-3GlcNAc a solid-phase ELISA was performed, in which Neu5Aca2-3Galp1-3GlcNAc 

163 was coated. As expected, no binding of QX-RBD to this particular glycan was observed at 

164 any of the protein concentrations, comparable to that of the negative control TCoV-SI (only 

165 binding longer branched galactose terminated glycans (25)), while M41-RBD bound to 

166 Neu5Aca2-3Gaipi-3GlcNAc in a concentration-dependent manner (Fig. 3B). 

167 To reveal whether QX-RBD exclusively depends on sialic acids, trachea and kidney tissue 

168 slides were pre-treated with Arthrobacter ureafaciens neuraminidase (AUNA) before applying 

169 M41- and QX-RBD. Removal of sialic acids from trachea and kidney tissue completely 

170 prevented binding of both RBD proteins (Fig. 3A, right column), indicating that QX-RBD 

171 binding is dependent on the presence of sialic acids on host tissues. 

172 M41-RBD gains kidney binding upon MLQ107-109KIP mutation. 

173 To gain in depth knowledge on the interaction of the IBV-RBD proteins and chicken tissue, 

174 we set out to determine the critical amino acids of viral spike proteins involved in binding to 

175 these glycan receptors, thereby leading to the ability to bind to kidney tissue. Chimeric RBD 

176 proteins were generated by dividing each wildtype RBD into three domains and mixing them 

177 to get six different combinations (schematic representations in Fig. 4A). These chimeras 

178 were then applied to trachea and kidney tissue slides. Chimeric proteins containing amino 

179 acids 98-156 (middle domain) of M41 (M-M-Q, Q-M-M, and Q-M-Q) demonstrated reduced 

180 binding to trachea and no detectable binding to kidney (Fig. 4B). In contrast, chimeric 
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181 proteins containing this region of QX (Q-Q-M, M-Q-Q, and M-Q-M) had comparable binding 

182 to tissues as QX-RBD. In particular, strong binding to the ciliated epithelial lining of trachea 

183 and specific staining in Bowmans capsule in the glomerulus was observed (Fig. 4B). Like 

184 wild type RBDs, binding of all chimeric proteins was dependent on the presence of sialic 

185 acids, as pre-treatment of host tissues with AUNA abrogated binding (data not shown). M-M- 

186 Q, Q-M-M and Q-M-Q proteins had reduced affinity for Neu5Aca2-3Gaipi-3GlcNAc (Fig. 

187 4C), potentially explaining the reduced staining of these proteins to trachea tissue (Fig. 4B). 

188 None of the RBD proteins containing the middle QX sequence (Q-Q-M, M-Q-Q, and M-Q-M) 

189 had affinity for this glycan in the ELISA as expected based on tissue staining (Fig. 4B and C), 

190 which is in line with the hypothesis these proteins are dependent on binding to the QX 

191 receptor instead of the known M41 receptor. These results indicate that the receptor binding 

192 site responsible for recognition of the QX receptor is determined by amino acids 99-159 of 

193 the spike. 

194 To ultimately determine the critical residues of the RBD for the interaction with chicken 

195 kidney tissue, additional chimeric proteins were produced and used in protein histochemistry. 

196 We exchanged two triplets (highlighted in dark green in Fig. 2A) of amino acids in FIVR 2 

197 (amino acids 99-115 of M41), either alone or in combination, that had the high diversity in 

198 amino acid characteristics (schematic representations Fig. 5A). Introduction of the M41 

199 sequence in the QX-RBD protein, SGS100-102Y (QX-Y) and KIP110-112MLQ (QX-MLQ) 

200 and their combination (QX-Y-MLQ), all resulted in a loss of binding to trachea and kidney 

201 tissues (Fig. 5B, right panel). In contrast, introduction of MLQ107-109KIP into M41-RBD 

202 (M41-KIP) resulted in gain of binding to glomeruli in kidney, both in a wild type background 

203 and in the Y99SGS (M41-SGS) mutant (Fig. 5B, left panel). In the ELISA both M41-SGS and 

204 M41-KIP demonstrated a decreased affinity for alpha-2,3-linked sialic acids compared to 

205 M41-RBD, while introduction of both triplets SGS and KIP (M41-SGS-KIP) completely 

206 abolished binding to this glycan (Fig. 5C). Taken together these results suggest that a 
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207 receptor binding site critical to establish kidney binding requires amino acids KIP at position 

208 107-109 in M41-RBD. 

209 Receptor binding site of the QX specific receptor differs from that proposed for M41. 

210 Finally, we modelled QX-RBD based on a structural overlay with the recently resolved cryo- 

211 EM structure of M41 spike (20), and focused on the amino acids allowing kidney binding. The 

212 overall structure of both proteins is comparable (Fig. 6A, green ribbon M41, blue ribbon QX), 

213 however the loop consisting of HVR 2 is slightly larger in QX-RBD as expected, as there are 

214 two additional amino acids present (Fig. 6A, SGS100-102 for QX-RBD versus Y99 in M41- 

215 RBD). Interestingly this loop was predicted to be involved in sugar binding (20), which we 

216 showed to be true for QX-RBD, but not for M41-RBD. In detail, the tyrosine (Y99) in the M41 

217 structure (Fig. 6A, beige) occupies more space than serine (S in QX) and can be seen 

218 reaching toward a neighboring loop. Furthermore the 110-112KIP sequence identified in QX- 

219 RBD (Fig. 6A, dark blue) places a positive charge at the protein surface which is not present 

220 in 107-109MLQ in M41-RBD (Fig. 6A light blue). Previous in silico docking analysis 

221 performed with potential alpha-2,3-linked ligands to the M41-RBD protein, identified amino 

222 acids S87, N144 and T162 to potentially be involved in receptor binding (17). When we 

223 highlighted these amino acids predicted to be involved in binding to alpha-2,3-linked sialic 

224 acids (Fig. 6B red spheres) and the amino acid triplicates involved in binding to the QX 

225 specific receptor (100-102 (SGS, yellow spheres) and 110-112 (KIP, dark blue spheres) in 

226 the overlayed RBD ribbon structure, we demonstrate that binding of the different ligands 

227 recognized by M41 and QX are on different sides of the protein (Fig. 6B). Furthermore, when 

228 these amino acids were highlighted in the full cryo-EM resolved structures of M41 (Fig. 6C) 

229 and QX (Fig. 6D), it clearly shows the potential ligand binding site of M41 is at a different 

230 location compared to the QX ligand binding site (Fig. 6C and D). 
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231 In conclusion we demonstrate that IBV-QX recognizes a sialylated glycan receptor present 

232 on chicken tissues that differs from that recognized by M41, and that this binding is likely 

233 required for the extended in vivo tissue tropism of the virus. 

234 Discussion 

235 In this study we reveal that nephropathogenic IBV-QX shows expanded binding tropism 

236 based on interactions with sialic acid(s) on chicken tissues that differs from the receptor 

237 elucidated for IBV-M41. Using chimeric proteins and in silico modeling, we conclude that 

238 amino acids in hyper variable region 2 are critical for recognizing such a sialylated glycan 

239 receptor. 

240 The N-terminal domain of IBV-QX spike protein comprises, like previously shown for M41 

241 (15), a receptor binding domain. Interestingly, QX-RBD shows no affinity for known ligand of 

242 M41 (Neu5Aca2-3Gaipi-3GlcNAc) in glycan ELISA, while it gained binding to a novel 

243 unidentified sialylated glycan receptor. Other avian gammacoronaviruses, including guinea 

244 fowl and turkey coronavirus are dependent on long glycans (linear or branched) capped with 

245 either an alpha-2,6-linked sialic acid (GfCoV only) or galactose ending glycans (both TCoV 

246 and GfCoV) (25, 26). Viruses of other coronavirus genera are dependent on sialic acid 

247 receptors, like alpha coronaviruses TGEV and PEDV (27, 28) and beta coronaviruses HCoV- 

248 OC43 and BCoV (29). Whether other nephropathogenic IBV strains are dependent on the 

249 same sialic acid receptors for binding and subsequent infection as QX, remains to be 

250 determined. 

251 

252 To elucidate the specific ligand used by QX-RBD we have performed several binding studies 

253 using previously developed glycan arrays (30, 31), containing multiple linear, and branched 

254 glycans capped without or with alpha-2,3-linked sialic acids or alpha-2,6-linked sialic acids. 

255 Unfortunately, no binding was observed using our RBD proteins. This may be explained by 

256 the usage of RBD proteins, instead of the full SI, like used previously (19) or the composition 
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257 and fine structure of the glycans present in both arrays. On the arrays used, most glycans 

258 contain the linkage found in mammals (Galpl ,4GlcNAc), while the minority contain 

259 Gaipi,3GlcNAc linkage. The exact nature of the receptor recognized by IBV-QX could be a 

260 more complex glycan containing a Gaipi ,3GlcNAc that is scarely populated on glycan 

261 arrays. 

262 

263 Comparison of the spikes of various IBV strains with reported nephropathogenicity, including 

264 IBV clade GI-14 (including strain B1648 (3)) and clade GI-13 (including strain 793B (3)), 

265 shows only nephropathogenic IBV clades contain an amino acid triplicate at position 1 GO- 

266 102, whereas in IBV Mass genotypes 99Y/H is expressed, thereby shortening HVR 2 with 

267 two amino acids. Sequence alignment of this amino acid triplicate (100-102 in QX-RBD) 

268 varies in nephropathogenic IBV genotypes from SGS/SGT for clade GI-19 (IBV-QX), 

269 NQQ/SQQ for clade GI-13 (IBV-793B) and SGAfor clade GI-14 (IBV-B1648) at that position. 

270 Furthermore, the amino acid triplet 110-112 KIP is not conserved across IBV genotypes. In 

271 these genotypes amino acid triplets LIQ for B1648 and MIP for 793B are present, which are 

272 sequence combinations of amino acids found in Mass (clade GI-1) and QX (clade GI-19). In 

273 terms of hydrophobicity and size, amino acid triplet MLQ (M41) is very similar to LIQ 

274 (B1648), whereas the proline (P) in KIP (QX) and MIP (793B) reduces the flexibility of the 

275 loop. 

276 

277 Structural analysis of the RBD of IBV suggests that the receptor binding sites for M41 and 

278 QX are positioned at different sides of the RBD (Fig. 6B, C and D). Previous in silico 

279 predictions of the interaction with alpha-2,3-linked sialic acid ligands in M41-RBD pointed 

280 towards three amino acids S87, N144 and T162, which are in close proximity four essential 

281 N-glycosylation sites (N33, N59, N85 and N160) (17). Although the amino acid sequence of 

282 S87, N144 and T162 is conserved between M41 and QX, one of the essential N- 

283 glycosylation sites is at a different position (N59 in M41, N58 in QX). This may result in a 

284 different conformation of the ligand binding site, thereby preventing in the QX-RBD wildtype 
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285 binding to the M41 ligand, which was supported by experimental evidence using chimeric 

286 M41-RBD protein where this glycosylation site was replaced, resulting in loss of binding to 

287 trachea tissue (data not shown). Furthermore, in the publication where the cryo-EM structure 

288 of M41 was resolved, the loop consisting of amino acids present in HVR 2 of the spike was 

289 proposed to be required for receptor binding (20). Our data points towards involvement of the 

290 unglycosylated loop containing HVR 2 for recognition of the QX glycan ligand but not the 

291 M41 ligand. Furthermore, the cryo-EM structure of the M41-CTD predicts other putative 

292 receptor-binding motif loops in M41 spike (20). In figure 1, we demonstrated no binding to 

293 trachea and kidney tissue was observed using our recombinantly expressed M41-CTD, in 

294 contrast to their published results. As binding of QX-RBD reflects the tissue tropism of QX 

295 infected birds, we speculate whether these loops (in the CTD) are necessary for initial 

296 receptor recognition and involved in QX infection. 

297 

298 In conclusion, we demonstrated that IBV-QX binding to chicken trachea and kidney is 

299 dependent on a sialylated glycan receptor and that amino acids in HVR 2 of the QX-RBD are 

300 critical for this receptor binding profile. This knowledge adds to our understanding of 

301 differences in tissue tropism between IBV strains in vivo and may contribute in designing new 

302 antivirals to prevent coronavirus infections in the field. 

303 

304 Materials and Methods: 

305 Construction of the expression plasmids: The expression plasmids containing the codon 

306 optimized M41-ED (amino acids 19-1091 (21)), M41-S1 (amino acids 19-532 (19)), M41- 

307 RBD (amino acids 19-272 (15)) and M41-CTD (amino acids 273-532 (15) accession number 

308 AY851295) sequence followed by a trimerization domain (GCN4) and strep-tag (ST) was 

309 described previously (15). The codon-optimized sequence of QX-RBD (amino acids 19-275, 

310 accession number AFJ11176), containing upstream Nhe\ and downstream Pad restriction 

311 sites, was obtained from GenScript and cloned into the pCD5 expression vector by restriction 

312 digestion, as previously described (19). Fragments to generate chimeric RBD proteins were 


D 

o 


Downloaded from http://jvi.asm.org/ on November 8, 2019 at UCSF LIBRARY 



cfWology JVI Journal of Virology Accepted MdnUSCiTpFPosted Oflllhe 


313 created by splice overlap extension PCR using primers in Table 2 and cloned into pJET 

314 vector (Thermo Scientific, USA). The sequences were verified by automated nucleotide 

315 sequencing (Macrogen, The Netherlands) before cloning each fragment into the pCD5 

316 expression vector. Mutations up to 9 nucleotides were introduced by site directed 

317 mutagenesis using primers listed in Table 2 and the sequences were subsequently verified 

318 by automated nucleotide sequencing (Macrogen, The Netherlands). 

319 Production of recombinant proteins: Recombinant RBD proteins were produced in human 

320 embryonic kidney (HEK293T) cells. In short, cells were transfected with pCD5 expression 

321 vectors using polyethylenimine (PEI) at a 1:12 (wt/wt) ratio. Cell culture supernatants were 

322 harvested after 6 days. The recombinant proteins were purified using Strep-Tactin 

323 Sepharose beads as previously described (19). Proteins were pretreated (where indicated) 

324 with PNGaseF (New England Biolabs, USA) according to manufacturer’s protocol before 

325 analysis by Western blot using Strep-tactin HRP antibody (IBA, Germany). 

326 Circular Dichroism (CD): Recombinant IBV-RBD proteins were exchanged into buffer 

327 containing 10 mM sodium phosphate, pH 7.75 and diluted to 0.06 mg/ml. CD spectra were 

328 collected on a JASCO J-810 spectropolarimeter with a Peltier thermostated fluorescence 

329 temperature controller module by accumulating 4 scans from 285-190 nm with a scanning 

330 speed of 10 nm/min, Digital Integrated Time 1 second, bandwidth 1 nm, and standard 

331 sensitivity at 25 °C. A thermal melt was done from 25 °C to 95 °C with a ramp rate of 1 °C per 

332 minute. A full CD scan was collected at 95 °C. After lowering the temperature to 25 °C, the 

333 protein was allowed to refold for 20 minutes at 25 °C, and a third CD scan was taken at 25 °C 

334 to measure recovery. Secondary structure calculations for the CD data collected at 25 °C 

335 before the thermal melt were processed by Dichroweb (23) using the CDSSTR (32), Selcon3 

336 (33), and Contill (34) algorithms with protein reference set 7. Results from the 3 algorithms 

337 were averaged and plotted in Fig. 2C. 

338 ELISA: Neu5Aca2-3Gal(31-3GlcNAc-PAA (Lectinity Holdings, Russia) was coated in a 96- 

339 well Nunc MaxiSorp plate (Sigma-Aldrich, Germany) at 0.5 pg/well overnight at 4 °C, 

340 followed by blocking with 3% bovine serum albumin (BSA; Sigma, Germany) in PBS-0.1% 
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341 Tween20 overnight. RBD proteins were pre-incubated with Strep-tactin HRP antibody (IBA, 

342 Germany) (1:200) for 30 min on ice. Indicated protein amounts were diluted in PBS and 

343 applied onto the coated well, followed by incubation for 2 h at room temperature. TMB 

344 (3,3=,5,5=-tetramethylbenzidine; Thermo Scientific, Netherlands) substrate was used to 

345 visualize binding, after which the reaction was terminated using 1 M H2S04. The optical 

346 density at 450 nm was measured in a FLUOstar Omega instrument (BMG Labtech), and 

347 MARS data analysis software was used for data analysis. Statistical analysis was performed 

348 using a two-way analysis of variance (ANOVA). 

349 Protein histochemistry: Protein histochemistry was performed as previously described 

350 (19). Recombinant proteins pre-complexed with Strep-tactin HRP antibody (IBA, Germany) 

351 were applied onto 4 pm sections of formalin-fixed paraffin-embedded healthy chicken tissues 

352 at 100 pg/ml (for RBD, SI and ED in equal molar amount) (24) and binding was visualized 

353 using 3-amino-9-ethyl-carbazole (AEC; Sigma-Aldrich, Germany). Where indicated RBD 

354 proteins were pre-incubated with 100 pg/ml Neu5Aca2-3Gaipi-3GlcNAc (Lectinity Holdings, 

355 Russia) for 30 min on ice, before application onto the tissues. Pre-treatment of tissues was 

356 performed using 2 mU of Neuraminidase (sialidase) from Arthrobacter ureafaciens (AUNA) 

357 (Sigma, Germany) in 10 mM potassium acetate and 2.5 mg/ml Triton X-100, pH 4.2, and 

358 incubated at 37 °C overnight, before protein application. 

359 
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467 Figure legends: 

468 FIG 1 Binding of M41-spike proteins to paraffin-embedded healthy chicken trachea and 

469 kidney tissue. A) Schematic representation of spike proteins, M41-ectodomain (ED, amino 

470 acids 1-1072), M41-S1 (amino acids 1-513), M41-RBD (amino acids 1-254) and M41-CTD 

471 (amino acids 254-513) followed by a trimerization domain (GCN4) and strep-tag (ST). 

472 Numbering starts at 1 of the mature protein sequence. B) Protein binding of M41 spike 

473 proteins observed in the trachea and kidney is visualized by red staining. C) Affinity of M41 

474 spike proteins for the known ligand (Neu5Aca2-3Galp1-3GlcNAc) in solid phase ELISA. 

475 Where at all protein amounts a significant difference of at least p< 0.01 was observed 

476 between M41-RBD, -SI, -ED and M41-CTD and TCoV-SI which served as a negative 

477 control tested in two-way ANOVA. Elisa was performed in triplicate where average and 

478 standard deviations are shown. 

479 

480 FIG 2 IBV M41- and QX-RBD protein analysis. A) Amino acid alignment of M41-RBD (amino 

481 acids 19-272, accession number AY851295) and amino acids 19-275 (accession number 

482 AFJ11176) of the QX spike. Numbering starts at 1 of the mature protein sequence (signal 

483 sequence not shown). Dots indicate identical amino acids. Grey highlights, surrounded by 

484 black box indicate previously identified hyper variable regions of IBV-Mass (22). Green 

485 highlights indicate very different residues. B) M41- and QX-RBD with and without pre- 

486 treatment of PNGaseF analyzed by Western blot using Strep-Tactin HRP antibody. C) 
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487 Percentage of secondary protein structures calculated based on CD analysis of M41- and 

488 QX-RBD. D) Binding of QX-RBD to paraffin-embedded healthy chicken trachea and kidney 

489 visualized by red staining in protein histochemistry. 

490 FIG 3 Avidity and affinity of M41- and QX-RBD for host factors. A) Protein histochemistry of 

491 M41- and QX-RBD onto paraffin-embedded chicken trachea and kidney tissue (left column), 

492 upon pre-incubation of proteins with Neu5Aca2-3Gaipi-3GlcNAc (middle column) or pre- 

493 treatment of tissues with Arthrobacter ureafaciens neuraminidase (right column). B) Affinity of 

494 RBD proteins for Neu5Aca2-3Galp1-3GlcNAc in ELISA where **= p< 0.01, ****= p<0.001 

495 tested in two-way ANOVA. TCoV-SI was used as a negative control in equal molar amounts. 

496 ELISA was performed in triplicate with all proteins; average is shown with standard 

497 deviations. 

498 FIG 4 Chimeric RBD protein binding to chicken tissues. A) Schematic representation of 

499 chimeric RBD proteins, grey box indicates M41- and white box QX wildtype sequence. Amino 

500 acids surrounding the transitions between the different domains of the chimeric proteins are 

501 indicated including the amino acid number of the wildtype sequence. B) Binding of chimeric 

502 RBDs to trachea and kidney tissue in protein histochemistry. C) Affinity of 37.5 nmol chimeric 

503 RBD proteins in ELISA for Neu5Aca2-3Galp1-3GlcNAc. Significant differences are *= 

504 p<0.05, **= p<0.01, ****= p<0.001, tested in two-way ANOVA. ELISA was performed with all 

505 chimeric RBD proteins in triplicates. 

506 FIG 5 Identification of amino acids involved in IBV kidney binding using chimeric proteins. A) 

507 Schematic representation of chimeric RBD proteins, grey box indicates M41- and white box 

508 QX wildtype sequence. Numbers above indicate the positions of the amino acid triplicates 

509 swapped between M41 and QX. B) Binding of chimeric RBDs to trachea and kidney tissue in 

510 protein histochemistry. C) Binding of chimeric RBD proteins (37.5nmol) in ELISA to 

511 Neu5Aca2-3Gaipi-3GlcNAc. Significant differences are *= p<0.05, **= p<0.01, tested in two- 

512 way ANOVA. ELISA was performed with all chimeric RBD proteins in triplicates. 
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513 FIG 6 Model of IBV spike with predicted receptor binding sites. A) Structural alignment 

514 overlay of QX-RBD (blue ribbon) onto M41-RBD (green ribbon), based on PDB entry 6cv0 

515 (20) using Swiss-model. Detailed representation of the receptor binding site identified for QX- 

516 RBD; indicated in yellow sticks are amino acids 100-102 SGS in QX-RBD and in beige 99 Y 

517 in M41-RBD. 110-112 KIP of QX-RBD is indicated with dark blue sticks, whereas light blue 

518 represents 107-109 MLQ in M41-RBD. B) Amino acids involved in receptor binding of IBV. 

519 Blue ribbon represents the modeled QX-RBD structure with amino acids 100-102 (SGS) as 

520 yellow spheres and 110-112 (KIP) as blue spheres. The green ribbon represents the M41- 

521 RBD with 99 (Y) as beige spheres and 107-109 (MLQ) as light blue spheres. Amino acids in 

522 red spheres (S87, N144 and T162) are previously predicted to be involved in alpha-2,3- 

523 linked sialic acid binding of M41-RBD (17). C) Surface representation of the trimeric M41 

524 spike cryo-EM structure (20). S2 is in dark gray for all monomers. SI is in light gray with one 

525 SI monomer colored bright green for the RBD domain and pale green for the CTD. Amino 

526 acids involved in ligand binding are highlighted: yellow is 99Y (100-102 SGS in QX), dark 

527 blue is 107-109MLQ (110-112 KIP in QX) and red is S87, N144 and T162. D) Modeled QX 

528 spike based on PDB entry 6cv0, colors as indicated in C, except the SI of QX is blue, and 

529 the RBD in bright blue. Representations on right of C and D are structures turned 90 degrees 

530 towards the viewer. All representations were made using PyMol viewer. 

531 TABLE 1 Relative binding of IBV-RBD proteins to paraffin-embedded healthy chicken tissues 

532 white, no visible staining; +- = staining of few cells; + = staining of most epithelial cells 

533 TABLE 2 Primer sequences to create chimeric RBD plasmids 

534 R= nt A/G K= nt G/T M=nt A/C Y=nt C/T W= nt A/T S= nt C/G 

535 Underlined sequences indicate nucleotides changed to introduce the mutation 

536 
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11! I E I Relative binding of IBV-RBD proteins to paraffin-embedded healthy chicken tissues 
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TABLE 2 Primer sequences to create chimeric RBD plasmids 



Original 

plasmid 

Forward primer 

Reverse primer 

MMQ 

M41-RBD 

gtcgcttccgtgctagca 

acaccagrtckccgttcag 


QX-RBD 

ctgaacggmgayctggtgt 

ctgcttcatgcgcttaattaa 



gtcgcttccgtgctagca 

ctgcttcatgcgcttaattaa 

MQM 

MQQ 

gtcgcttccgtgctagca 

acaccagrtckccgttcag 


M41-RBD 

ctgaacggmgayctggtgt 

ctgcttcatgcgcttaattaa 



gtcgcttccgtgctagca 

ctgcttcatgcgcttaattaa 

MQQ 

M41-RBD 

gtcgcttccgtgctagca 

tagcaatgwgtsacgaacactg 


QX-RBD 

cagtgttcgtsacwcattgcta 

ctgcttcatgcgcttaattaa 



gtcgcttccgtgctagca 

ctgcttcatgcgcttaattaa 

QQM 

QX-RBD 

gtcgcttccgtgctagca 

acaccagrtckccgttcag 


M41-RBD 

ctgaacggmgayctggtgt 

ctgcttcatgcgcttaattaa 



gtcgcttccgtgctagca 

ctgcttcatgcgcttaattaa 

QMQ 

QMM 

gtcgcttccgtgctagca 

acaccagrtckccgttcag 


QX-RBD 

ctgaacggmgayctggtgt 

ctgcttcatgcgcttaattaa 



gtcgcttccgtgctagca 

ctgcttcatgcgcttaattaa 

QMM 

QX-RBD 

gtcgcttccgtgctagca 

tagcaatgwgtsacgaacactg 


M41-RBD 

cagtgttcgtsacwcattgcta 

ctgcttcatgcgcttaattaa 



gtcgcttccgtgctagca 

ctgcttcatgcgcttaattaa 

M41-SGS 

M41-RBD 

atctgatggatgtcccatcacc 

ccqqa cttataqcaatqtqtca cq 

M41-KIP 

M41-RBD 

tcctaagaactttctgcgggtgtc 

atcttqccqqtqatqqqacatcc 

M41-SGS- 

KIP 

M-SGS 

tcctaagaactttctgcgggtgtc 

atcttqccqqtqatqqqacatcc 

QX-Y 

QX-RBD 

ctgtcccatcaccggcaag 

tacccggagctgtagcaatg 

QX-MLQ 

QX-RBD 

qcaqcaaqaccacatcaaaatttc 

aqcatqccqqtqatqqqacaaqa 

QX-Y- 

MLQ 

Q-Y 

qcaqcqaaaccacatcaqaatttc 

aqcatqccqqtqatqqqacaaqa 


R= nt A/G K= nt G/T M=ntA/C Y=nt C/T W= nt A/T S= nt C/G 
Underlined sequences indicate nucleotides changed to introduce the mutation 
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